Segmentation of Nastaliq Script for OCR

نویسندگان

  • Sohail A. Sattar
  • Shamsul Haque
  • Mahmood K. Pathan
چکیده

In this paper we have presented a novel segmentation technique for the implementation of an OCR (Optical Character Recognition) for printed Nastalique text, a calligraphic style of Urdu which uses the Arabic script for its writing. OCR for many of the world major languages have been developed and are being used but at present an OCR for Nastalique is not available and the published research on Nastalique OCR, Urdu OCR or even on any area of Urdu computing is almost non-existent, the reason being the challenges that the Nastalique style poses for its optical recognition. We used Matlab 7 for our experimentation the results are reported in this paper which are very encouraging.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Survey on Script Segmentation for Bangla OCR

Script segmentation is an important primary task for any Optical Character Recognition (OCR) software. Especially, in case of off-line OCR for printed character, it has more importance. Through script segmentation a big image of some written document is fragmented into a number of small pieces which are then used for pattern matching to determine the expected sequence of characters. In the impl...

متن کامل

Optical Character Recognition System for Urdu Words in Nastaliq Font

Optical Character Recognition (OCR) has been an attractive research area for the last three decades and mature OCR systems reporting near to 100% recognition rates are available for many scripts/languages today. Despite these developments, research on recognition of text in many languages is still in its early days, Urdu being one of them. The limited existing literature on Urdu OCR is either l...

متن کامل

A Hybrid Approach to Classify Gurmukhi Script Characters

Researchers have worked extensively on OCR, in the past few decades. This is also visible from the fact that various types of OCR are available in the market. Out of these available OCR’s majority is to support foreign languages. In Indian context, majority of available OCR’s are for Hindi and Bangla, but a very few reports are available on Gurmukhi script which is used to write Punjabi languag...

متن کامل

Comparative study of the paper inscriptions available in the museum of Abdolazim holy shrine and the tile inscriptions located in the veranda of Imamzadeh Taher’s shrine, by Mohammd Ebrahim Tehrani applying Nastaliq script

The Shrine of Abdolazim located in Ancient historical city of Ray is a complex including holy shrines of Immamzadeh Taher, Immamzadeh Hamzeh and Abdolazim Hassni. Immamzadeh Taher’s shrine is placed at the north side of the complex and at the east side of Abdolazim Hassni’s shrine and the museum of the complex also is at the south east of Mosalla (praying room for muslims) which is ornamented w...

متن کامل

Segmentation of Handwritten Documents Containing Kannada Script

Segmentation is one of the important phases of Optical Character Recognition (OCR) system, which extracts objects of interest from an image. Feature extraction and classification phases of OCR will be more effective, if the techniques selected for segmentation is effective. This paper focuses on to develop a system for handwritten documents containing Kannada script and proposes suitable techni...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009